Methods, systems, articles of manufacture and apparatus to generate digital scenes

ABSTRACT

Methods, systems, articles of manufacture and apparatus to generate digital scenes are disclosed. An example apparatus to generate labelled models includes a map builder to generate a three-dimensional (3D) model of an input image, a grouping classifier to identify a first zone of the 3D model corresponding to a first type of grouping classification, a human model builder to generate a quantity of placeholder human models corresponding to the first zone, a coordinate engine to assign the quantity of placeholder human models to respective coordinate locations of the first zone, the respective coordinate locations assigned based on the first type of grouping classification, a model characteristics modifier to assign characteristics associated with an aspect type to respective ones of the quantity of placeholder human models, and an annotation manager to associate the assigned characteristics as label data for respective ones of the quantity of placeholder human models.

CROSS-REFERENCE TO RELATED APPLICATION

This patent application arises from a 371 nationalization of PCT PatentApplication No. PCT/EP2019/057961, entitled “METHODS, SYSTEMS, ARTICLESOF MANUFACTURE AND APPARATUS TO GENERATE DIGITAL SCENES,” and which wasfiled on Mar. 28, 2019, which claims priority to U.S. Provisional PatentApplication Ser. No. 62/650,722, which was filed on Mar. 30, 2018. PCTPatent Application No. PCT/EP2019/057961 and U.S. Provisional PatentApplication Ser. No. 62/650,722 are hereby incorporated herein byreference in their entireties. Priority to PCT Patent Application No.PCT/EP2019/057961 and U.S. Provisional Patent Application Ser. No.62/650,722 is hereby claimed.

FIELD OF THE DISCLOSURE

This disclosure relates generally to network training, and, moreparticularly, to methods, systems, articles of manufacture and apparatusto generate digital scenes.

BACKGROUND

In recent years, neural networks have been utilized to aid in theanalysis of scenes. In some examples, neural networks aid in theidentification and/or classification of emerging patterns, such aspatterns related to crowd behavior.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic illustration of an example digital scene generatorto generate a digital scene with labelled human models.

FIG. 2 is a schematic illustration of an example zone map analyzed bythe example digital scene generator of FIG. 1.

FIGS. 3-7 are flowcharts representative of machine readable instructionswhich may be executed to implement the example digital scene generatorof FIG. 1 to generate digital scenes with labelled human models.

FIG. 8 is a block diagram of an example processing platform structuredto execute the instructions of FIGS. 3-7 to implement the exampledigital scene generator of FIG. 1.

The figures are not to scale. In general, the same reference numberswill be used throughout the drawing(s) and accompanying writtendescription to refer to the same or like parts.

Descriptors “first,” “second,” “third,” etc. are used herein whenidentifying multiple elements or components which may be referred toseparately. Unless otherwise specified or understood based on theircontext of use, such descriptors are not intended to impute any meaningof priority or ordering in time but merely as labels for referring tomultiple elements or components separately for ease of understanding thedisclosed examples. In some examples, the descriptor “first” may be usedto refer to an element in the detailed description, while the sameelement may be referred to in a claim with a different descriptor suchas “second” or “third.” In such instances, it should be understood thatsuch descriptors are used merely for ease of referencing multipleelements or components.

DETAILED DESCRIPTION

Crowd estimation has a wide range of applications, which includecomputer vision applications, robotics applications and securitysurveillance applications. Computer vision techniques and deep learningtechnologies have enabled large scale crowd estimation, but advances insuch techniques have been hindered by a lack of high quality, annotated(labelled) and publicly available datasets. Tasks related to crowdcounting, crowd segmentation and crowd tracking may be assisted withconvolutional neural network (CNN) techniques, but such techniquesrequire a large amount of labelled data to perform well. While manypublicly-placed cameras obtain images (e.g., high resolution images) ofscenes having large amounts of people, such images require tediouslabelling tasks to aid with one or more machine learning tasks (e.g.,those involving CNN techniques). Additionally, even if the people withinthose publicly-available scenes were labelled, some privacy laws blockthe release of real-world footage for private and/or public use. Forinstance, crowd videos captured from surveillance cameras (e.g.,closed-captioned television (CCTV)) are seldom released for public usein view of General Data Protection Regulation (GDPR) legislation, underwhich release forms are required from individuals whose personalinformation is captured in an image.

In the event crowd data is available for which all privacy regulationsare met and tedious labelling efforts have occurred, such crowd data istypically limited to a particular scene. Stated differently, a varietyof scenes on which to perform machine learning analysis on crowds is notreadily available, thus crowd research managers have limitedflexibility. Furthermore, crowd labelling efforts are typicallyperformed by disparate teams of human beings. Typically, human-basedlabelling efforts suffer from bias, inaccuracy and fatigue. Moreover,multiple different human annotators lead to inconsistencies within thelabelled data, which may lead to ground truth reliance issues.

Examples disclosed herein generate photo-realistic scalable labelledsynthetic crowds and/or synthetic images including crowds for thepurpose of, in part, accelerating crowd understanding techniques andmachine learning efforts. In some examples, human models are generatedon scene reconstructed environments. The human models are tailored tosatisfy design requirements (e.g., quantity and type of gender, walking,running, face orientation, clothing types, etc.) and correspondinglabelling characteristics are retained and/or otherwise associated witheach human model without human annotator participation. In someexamples, the scene reconstructed environments are created with footagecaptured by aerial drone surveys, and then crowds are composited withoriginal images to generate photo-realistic data.

FIG. 1 is a schematic illustration of an example digital scene generator102 to enable generating digital scenes with labelled human models. Inthe illustrated example of FIG. 1, the digital scene generator 102includes an example map builder 104, an example human model builder 106,and an example coordinate engine 108. The example coordinate engine 108of FIG. 1 includes an example zone selector 110 and an example groupingclassifier 112. The example digital scene generator 102 also includes anexample model aspect manager 114, which includes an example metadatamanager 116 and an example model characteristic modifier 118. Theexample digital scene generator 102 also includes an examplephotorealism adjustor 126, which includes an example image datadeterminer 128, an example profile adjustor 130, an example shadowadjustor 132, an example depth-of-field adjustor 134, and an examplenoise adjustor 136. The example digital scene generator 102 alsoincludes an example annotation manager 138 and an example transformationengine 140. In this example, the map builder 104 implements means formap building. The means for map building may additionally oralternatively be implemented by map building means. In this example, thehuman model builder 106 implements means for human model building. Themeans for human model building may additionally or alternatively beimplemented by human model building means. In this example, thecoordinate engine 108 implements means for coordinate assigning. Themeans for coordinate assigning may additionally or alternatively beimplemented by coordinate assigning means. In this example, the zoneselector 110 implements means for zone selecting. The means for zoneselecting may additionally or alternatively be implemented by zoneselecting means. In this example, the grouping classifier 112 implementsmeans for grouping. The means for grouping may additionally oralternatively be implemented by grouping means. In this example, themodel aspect manager 114 implements means for aspect modification. Themeans for aspect modification may additionally or alternatively beimplemented by aspect modification means. In this example, the metadatamanager 116 implements means for managing metadata. The means formanaging metadata may additionally or alternatively be implemented bymetadata managing means. In this example, the model characteristicmodifier 118 implements means for model characteristics modification.The means for model characteristics modification may additionally oralternatively be implemented by model characteristics modificationmeans. In this example, the photorealism adjustor 126 implements meansfor photorealism adjusting. The means for photorealism adjusting mayadditionally or alternatively be implemented by photorealism adjustingmeans. In this example, the image data determiner 128 implements meansfor image data determining. The means for image data determining mayadditionally or alternatively be implemented by image data determiningmeans. In this example, the profile adjustor 130 implements means forprofile adjusting. The means for profile adjusting may additionally oralternatively be implemented by profile adjusting means. In thisexample, the shadow adjustor 132 implements means for shadow adjusting.The means for shadow adjusting may additionally or alternatively beimplemented by shadow adjusting means. In this example, thedepth-of-field adjustor 134 implements means for depth-of-fieldadjusting. The means for depth-of-field adjusting may additionally oralternatively be implemented by depth-of-field adjusting means. In thisexample, the noise adjustor 136 implements means for noise adjusting.The means for noise adjusting may additionally or alternatively beimplemented by noise adjusting means. In this example, the annotationmanager 138 implements means for annotation. The means for annotationmay additionally or alternatively be implemented by annotation means. Inthis example, the transformation engine 140 implements means fortransformation. The means for transformation may additionally oralternatively be implemented by transformation means.

In operation, the example map builder 104 generates a model from aninput image. The example input image is sometimes referred to as abackground scene. The input image may include one or more images (e.g.,stereoscopic images) from different angles and/or points of view, suchas one or more images from video footage of a scene of interest (e.g., apark, a school campus, etc.). Such video footage may originate fromaerial drone data capture, in which three-dimensional (3D) models areconstructed (e.g., as a 3D mesh) via one or more motion methodologytechniques. 3D models allow a coordinate mapping reference of the sceneof interest, in which the coordinate mapping provides an ability tomeasure, calculate and/or otherwise identify distance informationbetween one or more objects of the scene of interest (e.g., distancevalues associated with a length of a road, a height of a building,dimensions of greenspace, etc.). While the example 3D model generated bythe example map builder 104 is based on the one or more images of thescene of interest, the 3D model (e.g., the 3D mesh) includes one or moreobject file formats, and is represented as a geolocated point cloud. Insome examples, the generated point cloud includes discrete geolocatedcoordinate points of different aspects of the scene of interest, such ascoordinate points indicative of road boundaries, building boundaries,etc. In some examples, the 3D object file format is supported by one ormore rendering applications, such as Blender®, Cycles Renderer®, etc. Assuch, the geolocated point cloud may not include shading, coloringand/or other imagery that is typically associated with photorealismcharacteristics suitable for machine learning and/or training. Examplesdisclosed herein insert human models into the 3D model and, to improveone or more photorealism characteristics in a manner suitable formachine learning, overlay the scene of interest (e.g., originallycaptured image(s)) onto the 3D model, as described in further detailbelow.

In some examples, captured images from the scene of interest includelighting conditions unique to a particular sun position (e.g., ifoutdoors) or no sun position at all. In such examples, the capturedimages may also include shadows for one or more objects within the sceneof interest (e.g., shadows caused by buildings, shadows caused by trees,shadows caused by vehicles, etc.). As described in further detail below,in an effort to improve a degree of photorealism for machine learning,examples disclosed herein insert human models into the 3D model in amanner consistent with lighting conditions associated with the originalscene of interest.

The example map builder 104 retrieves 3D model input (e.g., from a user)to cause insertion of human models into the scene of interest. In someexamples, the map builder 104 retrieves zone information indicative ofdifferent zones of the scene of interest that are to include placementof human models subject to different distance constraints. Map/modelinput may originate from user input on the 3D model in the form ofpainting, highlighting and/or otherwise identifying particularregions/zones of the 3D model with particular colors, in which eachcolor represents a type of human model characteristic (e.g., particulartypes of grouping classifications). For example, some zones of the 3Dmap correspond to a first grouping classification (e.g., a particularzone was painted a particular color corresponding to the first groupingclassification), while other zones correspond to a second groupingclassification. In some examples, the map builder 104 obtains such mapinput zone details in conjunction with a particular quantity of humanmodels that are to be inserted in the respective zone (e.g., insert arelatively low quantity of human models in a first zone associated witha lawn picnic, insert a relatively high quantity of human models in asecond zone associated with a concert, etc.). An example first groupingclassification includes a co-dependent placement grouping type, in whichhuman models to be inserted into the first zone have a relationship withadjacent human models of that zone. Examples of the co-dependentplacement grouping type classification include groups of people having apicnic (e.g., on a greenspace zone of the scene of interest), groups ofpeople watching a concert, groups of people marching in a parade, etc.Different grouping classification types (e.g., the co-dependentplacement grouping type) include different threshold distances betweenadjacent members in that zone, such as adjacent members relatively closeto one another during a picnic, a concert, a parade, etc. Alternatively,an independent placement grouping type is indicative of human models tobe inserted into a zone (e.g., a second zone) that do not have arelationship with adjacent human models. For example, people(represented by human models on the 3D model) walking along a sidewalkor sitting alone on a park bench are not behaving in a coordinatedmanner with other people that might be adjacent to them in thatparticular zone. As such, a relative distance between adjacent humanmodels associated with the independent grouping type may be greater thansuch distances observed between human models in the co-dependentgrouping type.

FIG. 2 is an example zone map 200 analyzed by the example map builder104 when retrieving, receiving and/or otherwise obtaining input and/orinstructions regarding human model placement. In the illustrated exampleof FIG. 2, the zone map 200 includes a first greenspace area 202, asecond greenspace area 204, and a walking path 206. The example walkingpath 206 includes a first region of interest (ROI) 208 indicative of azone associated with the independent grouping classification. Theexample first ROI 208 may have been identified by a prior user paintingeffort on the example zone map 200. As such, placement of human modelsin the first ROI 208 exhibits a first distance threshold valuetherebetween. Such first distance threshold placements of the humanmodels more accurately represent individual people behaving in a mannerindependent of one another, such as individuals walking to work, walkingto a class, etc. In some examples, placement of human models associatedwith the independent grouping classification will also exhibit a poseorientation associated with an expected direction of travel, such as apose orientation parallel to the example walking path 206. Some humanmodels may exhibit a pose 180 degrees opposite of one or more adjacenthuman models to represent people moving in different directions alongthe walking path 206.

In the illustrated example of FIG. 2, the second greenspace area 204includes a second ROI 210 indicative of a zone associated with theco-dependent grouping classification. As such, human models placed inthe second ROI 210 exhibit a second distance threshold valuetherebetween. Such second distance threshold placements of the humanmodels more accurately represent groups of people that have a degree ofinteraction, such as people attending a concert, having a picnic orwalking in a parade. Typically, such people will have a relativelysmaller distance therebetween, particularly in very crowded concerts,parades, etc. In some examples, placement of human models associatedwith the co-dependent grouping classification will also exhibit a poseorientation toward a similar focal point (e.g., a center of a stageduring a concert), or a focal point central to a group of people in agenerally circular formation (e.g., people facing each other during apicnic). Examples disclosed herein enable the acquisition of suchdesired zones, particular grouping classifications and/or particularpose details. Using such acquired information from the example zone map200, human model placement is performed in a more realistic manner,thereby improving one or more machine learning training operations.

The example zone selector 110 selects a particular zone of interest fromthe 3D model (e.g., the example zone map 200 of FIG. 2), and the examplehuman model builder 106 generates a number of placeholder human modelsbased on a target quantity for the selected zone. As used herein,placeholder human models represent human models that are not yettailored to include particular characteristics of an aspect type. Insome examples, placeholder human models include coordinate locations ofhuman models placed on respective coordinate positions of the 3D model(e.g., 3D mesh) of the scene of interest, in which such placeholderhuman models are devoid of the aspect characteristics. In some examples,the respective coordinate positions of the human models are selected ina random manner to improve a degree of photorealism. As used herein, anaspect type represents a particular characteristic grouping of a humanmodel. Example aspect types include, but are not limited to a raceaspect, a gender aspect, an age aspect, a height aspect, a muscleaspect, a weight aspect, a pose aspect, a movement-type aspect (e.g.,walking, running, sifting, etc.), or a clothing aspect. As used herein,a characteristic represents a particular aspect detail. For example,characteristics of the gender aspect may include, but are not limited toa female gender or a male gender. Example characteristics of the ageaspect may include an integer value of an age of the human model, andexample characteristics of the pose aspect include, but are not limitedto an orientation, a standing pose, a sitting pose, a walking pose or arunning pose.

The example grouping classifier 112 determines if a selected zone ofinterest (e.g., a zone from the example zone map 200) is associated witha co-dependent grouping classification. If so, the example coordinateengine 108 assigns respective placeholder human models to respectivecoordinate locations of the 3D model corresponding to constraints of thegrouping classification type. In some examples, the coordinate engine108 assigns the human model to a corresponding coordinate location andalso aligns a facial orientation of the human model based on a referencefocal point. For instance, if a number of human models corresponding tothe co-dependent placement grouping classification are to simulatewatching a concert, then the example coordinate engine 108 establishesan orientation of each human model facing the reference focal point ofinterest (e.g., a center of a stage in the example 3D model). In someexamples, the coordinate engine 108 applies a variation factor to eachplaced human model so that the facial orientation (e.g., a directionalorientation) is not too homogenous and/or artificial in appearance. Insome examples, the coordinate engine 108 arranges each human modelwithin a threshold orientation+/−x-degrees as determined by a randomnumber generator. When all placeholder human models of a selected zoneof interest have been assigned to corresponding coordinate locations ofthe 3D model, the example zone selector 110 determines whether theexample 3D model includes one or more additional zones of interest thathave not yet been processed for human model placement. In the eventadditional zones and/or human models are to be placed on the 3D model,the aforementioned process repeats accordingly. In some examples, aquantity of human models is placed on the 3D model based on user inputinformation, such as a number of people to represent at a concert, anumber of people to represent at a picnic, or a number of people torepresent walking down a sidewalk/street.

After the example 3D model includes assignments of human models tocorresponding coordinate locations, the example model aspect manager 114assigns characteristics to the human models. The example model aspectmanager 114 selects a human model from the 3D model, and the examplemetadata manager 116 extracts metadata associated with the selectedhuman model. For example, metadata associated with respective humanmodels may include, but are not limited to the grouping classificationtype (e.g., co-dependent, independent), an activity type (e.g., watchinga concert, walking to work), with which zone the human model isassociated, etc. While example human models may have certain metadataassociated with the corresponding grouping classification type,additional metadata is added by examples disclosed herein to serve aslabel information that assists machine learning activities. The examplemodel characteristic modifier 118 selects a candidate model aspect type.As described above, aspect types may include but are not limited to arace aspect, a gender aspect, an age aspect, a height aspect, a muscleaspect, a weight aspect, a pose aspect, a movement-type aspect (e.g.,walking, running, sitting, etc.), or a clothing aspect.

The example model characteristic modifier 118 assigns a characteristicof the selected aspect type (e.g., “male” is selected from the aspecttype “gender”), and the example metadata manager 116 adds, associatesand/or otherwise stores the characteristic value (e.g., “male”) asmetadata to the selected human model (e.g., the human models may bestored within a data structure). In some examples, the example modelcharacteristic modifier 118 utilizes a random number generator torandomly select a characteristic from each aspect type of interest. Instill other examples, each candidate characteristic may be paired withco-characteristics that are typically deemed mutually exclusive, such asa male versus female characteristic, a pants versus shortscharacteristic, a facial hair versus non-facial hair characteristic,etc. The example metadata manager 116 thus provides an equal chance ofeach particular characteristic being represented in a stochastic manner.As such, photorealism is improved for machine learning training byeliminating bias or skew caused by homogeneous characteristic selection.The example model characteristic modifier 118 determines whether thereare one or more additional aspect types of interest to be considered forthe selected human model and, if so, selects a next unexplored aspecttype in a similar manner. When all possible aspect types havecorresponding characteristics assigned to the human model, the examplemodel aspect manager 114 selects a next human model for characteristicassignment.

While the example 3D model has a number of synthetically generated humanmodels placed thereon at particular geographic coordinates, examplesdisclosed herein apply additional modifications to the human models toimprove a degree of photorealism. The example photorealism adjustor 126invokes one or more changes to the 3D model and/or human models therein.The example image data determiner 128 retrieves, receives and/orotherwise obtains characteristics associated with the scene of interest,which may include any number of images acquired from aerial dronephotography tasks. In some examples, the image data determiner 128retrieves frustum settings associated with the camera(s) responsible forthe scene of interest. In some examples, the image data determiner 128retrieves lighting information associated with respective images of thescene of interest, such as respective locations of the sun when theimage(s) was taken, locations of light in a room, etc. Generallyspeaking, information related to source locations of light aid in thegeneration of realistic shadows for the human models such that shadowsof those human models are consistent with respect to shadows captured bythe camera(s) (e.g., shadows of buildings, trees, etc.) when taking thesource image(s).

The example profile adjustor 130 adjusts the human model profiles basedon source camera angle information. For instance, examples disclosedherein adjust the coordinate scaling of the human models based on sourcecamera angle information so that the human models appear consistent withscene objects (e.g., trees, buildings, etc.). The example shadowadjustor 132 applies shadows to the human models based on shadowcharacteristics of the originally captured images of the scene ofinterest. Typically, because the captured images of the scene ofinterest are taken at a particular time of day, lighting conditionscause variations in shadows of objects within the scene. Accordingly,examples disclosed herein apply shadows to the human models in a mannerconsistent with shadows appearing on objects of the scene. The exampledepth of field adjustor 134 adjusts a depth-of-field characteristic ofrespective human models based on a desired focal point of the scene. Theexample noise adjustor 136 applies noise in an effort to improvephotorealism. In some examples, one or more crowds are adjusted toappear out of focus to create a more photo-realistic image. In someexamples, pixilation is applied by the noise adjustor 136 on top ofparticular human models to remove smooth edges that occur as a result ofsynthetic rendering processes.

The example map builder 104 overlays the original images of the scene ofinterest over the 3D model (3D mesh) to create a synthetic image. As aresult, a degree of photorealism of the synthetic image is improvedbecause actual images of the scene of interest are used instead ofrendering wireframe/mesh data of the 3D model. However, while thesynthetic image now includes (a) actual images from the scene ofinterest overlaid upon (b) human models having characteristic metadataand 3D coordinate information (e.g., x-axis, y-axis, z-axis,projection-view data, model-view data), machine learning operationsrequire two-dimensional (2D) coordinate information with associatedlabel information. In some examples, the map builder 104 applies awatermark to the synthetic image to identify that all human imagesand/or human faces are simulated. Such watermarking may proactivelysubdue any concerns that the images are in violation of one or morejurisdictional laws/rules related to privacy (e.g., General DataProtection Regulation (GDPR)—Regulation (EU) 2016/679).

Examples disclosed herein facilitate scalable techniques to performannotation tasks with pixel level accuracy. At least one advantage ofusing synthetic human model data (e.g., the human models generated bythe example human model builder 106) and 3D models is the access topositional and dimensional information of all objects within the scene.Pixel co-ordinates of each character model in 2D space is obtained whenrendered onto an image (e.g., a 3D projection). Example transformationsare performed by the example transformation engine 140 in a mannerconsistent with example Equation 1.P′=P*M(x,y,z,1)′   Equation 1.In the illustrated example of Equation 1, P and M represent a projectionand model-view matrix, respectively. The point P′ is normalized so thatit can be mapped back (by the example transformation engine 140) ontothe rendered synthetic map/image. Camera parameters are modelled similarto specifications found on industry-standard cameras, such as theexample DJI Phantom 3 drone camera. Such images render from the modelusing a 94 degree field of view and 20 mm sensors. In view of varyingheights of models, an annotated point is set by the exampletransformation engine 140 to a center of a body of interest. For eachrendered human model, an accompanying pixel coordinate is provided bythe example transformation engine 140 in vector form (U, V form, where Ureflects a horizontal 2D dimension and V reflects a vertical 2Ddimension) and written to, for example, a text file.

Stated differently, the example annotation manager 138 invokes theexample model aspect manager 114 to select a human model from the 3Dmodel (e.g., the synthetic image), and the example coordinate engine 108extracts 3D coordinate information. The example transformation engine140 transforms the 3D coordinate information to a 2D coordinaterepresentation, as described above in a manner consistent with exampleEquation 1. The example metadata manager 116 embeds metadata to the 2Dcoordinate information as label data, which includes characteristics ofdifferent aspect types, activity types and/or grouping classificationinformation. The example transformation engine 140 maps the human modelto the 2D coordinate (e.g., U, V and characteristic label information)on the synthetic image. This process is repeated for any number of humanmodels on the synthetic image such that subsequent machine learningoperations may proceed in view of the scene of interest.

While an example manner of implementing the digital scene generator 102of FIG. 1 is illustrated in FIGS. 1 and 2, one or more of the elements,processes and/or devices illustrated in FIG. 1 may be combined, divided,re-arranged, omitted, eliminated and/or implemented in any other way.Further, the example map builder 104, the example human model builder106, the example coordinate engine 108, the example zone selector 110,the example grouping classifier 112, the example model aspect manager114, the example metadata manager 116, the example model characteristicmodifier 118, the example photorealism adjustor 126, the example imagedata determiner 128, the example profile adjustor 130, the exampleshadow adjustor 132, the example depth-of-field adjustor 134, theexample noise generator 136, the example annotation manager 138, theexample transformation engine 140 and/or, more generally, the exampledigital scene generator 102 of FIG. 1 may be implemented by hardware,software, firmware and/or any combination of hardware, software and/orfirmware. Thus, for example, any of the example map builder 104, theexample human model builder 106, the example coordinate engine 108, theexample zone selector 110, the example grouping classifier 112, theexample model aspect manager 114, the example metadata manager 116, theexample model characteristic modifier 118, the example photorealismadjustor 126, the example image data determiner 128, the example profileadjustor 130, the example shadow adjustor 132, the exampledepth-of-field adjustor 134, the example noise generator 136, theexample annotation manager 138, the example transformation engine 140and/or, more generally, the example digital scene generator 102 of FIG.1 could be implemented by one or more analog or digital circuit(s),logic circuits, programmable processor(s), programmable controller(s),graphics processing unit(s) (GPU(s)), digital signal processor(s)(DSP(s)), application specific integrated circuit(s) (ASIC(s)),programmable logic device(s) (PLD(s)) and/or field programmable logicdevice(s) (FPLD(s)). When reading any of the apparatus or system claimsof this patent to cover a purely software and/or firmwareimplementation, at least one of the example map builder 104, the examplehuman model builder 106, the example coordinate engine 108, the examplezone selector 110, the example grouping classifier 112, the examplemodel aspect manager 114, the example metadata manager 116, the examplemodel characteristic modifier 118, the example photorealism adjustor126, the example image data determiner 128, the example profile adjustor130, the example shadow adjustor 132, the example depth-of-fieldadjustor 134, the example noise generator 136, the example annotationmanager 138, the example transformation engine 140 and/or, moregenerally, the example digital scene generator 102 of FIG. 1 is/arehereby expressly defined to include a non-transitory computer readablestorage device or storage disk such as a memory, a digital versatiledisk (DVD), a compact disk (CD), a Blu-ray disk, etc. including thesoftware and/or firmware. Further still, the example digital scenegenerator 102 of FIG. 1 may include one or more elements, processesand/or devices in addition to, or instead of, those illustrated in FIGS.1 and/or 2, and/or may include more than one of any or all of theillustrated elements, processes and devices. As used herein, the phrase“in communication,” including variations thereof, encompasses directcommunication and/or indirect communication through one or moreintermediary components, and does not require direct physical (e.g.,wired) communication and/or constant communication, but ratheradditionally includes selective communication at periodic intervals,scheduled intervals, aperiodic intervals, and/or one-time events.

Flowcharts representative of example hardware logic, machine readableinstructions, hardware implemented state machines, and/or anycombination thereof for implementing the digital scene generator 102 ofFIG. 1 are shown in FIGS. 3-7. The machine readable instructions may beone or more executable programs or portion(s) of an executable programfor execution by a computer processor such as the processor 812 shown inthe example processor platform 800 discussed below in connection withFIG. 8. The program(s) may be embodied in software stored on anon-transitory computer readable storage medium such as a CD-ROM, afloppy disk, a hard drive, a DVD, a Blu-ray disk, or a memory associatedwith the processor 812, but the entire program(s) and/or parts thereofcould alternatively be executed by a device other than the processor 812and/or embodied in firmware or dedicated hardware. Further, although theexample program(s) is/are described with reference to the flowchartsillustrated in FIGS. 3-7, many other methods of implementing the exampledigital scene generator 102 may alternatively be used. For example, theorder of execution of the blocks may be changed, and/or some of theblocks described may be changed, eliminated, or combined. Additionallyor alternatively, any or all of the blocks may be implemented by one ormore hardware circuits (e.g., discrete and/or integrated analog and/ordigital circuitry, an FPGA, an ASIC, a comparator, anoperational-amplifier (op-amp), a logic circuit, etc.) structured toperform the corresponding operation without executing software orfirmware.

The machine readable instructions described herein may be stored in oneor more of a compressed format, an encrypted format, a fragmentedformat, a packaged format, etc. Machine readable instructions asdescribed herein may be stored as data (e.g., portions of instructions,code, representations of code, etc.) that may be utilized to create,manufacture, and/or produce machine executable instructions. Forexample, the machine readable instructions may be fragmented and storedon one or more storage devices and/or computing devices (e.g., servers).The machine readable instructions may require one or more ofinstallation, modification, adaptation, updating, combining,supplementing, configuring, decryption, decompression, unpacking,distribution, reassignment, etc. in order to make them directly readableand/or executable by a computing device and/or other machine. Forexample, the machine readable instructions may be stored in multipleparts, which are individually compressed, encrypted, and stored onseparate computing devices, wherein the parts when decrypted,decompressed, and combined form a set of executable instructions thatimplement a program such as that described herein. In another example,the machine readable instructions may be stored in a state in which theymay be read by a computer, but require addition of a library (e.g., adynamic link library (DLL)), a software development kit (SDK), anapplication programming interface (API), etc. in order to execute theinstructions on a particular computing device or other device. Inanother example, the machine readable instructions may need to beconfigured (e.g., settings stored, data input, network addressesrecorded, etc.) before the machine readable instructions and/or thecorresponding program(s) can be executed in whole or in part. Thus, thedisclosed machine readable instructions and/or corresponding program(s)are intended to encompass such machine readable instructions and/orprogram(s) regardless of the particular format or state of the machinereadable instructions and/or program(s) when stored or otherwise at restor in transit.

As mentioned above, the example processes of FIGS. 3-7 may beimplemented using executable instructions (e.g., computer and/or machinereadable instructions) stored on a non-transitory computer and/ormachine readable medium such as a hard disk drive, a flash memory, aread-only memory, a compact disk, a digital versatile disk, a cache, arandom-access memory and/or any other storage device or storage disk inwhich information is stored for any duration (e.g., for extended timeperiods, permanently, for brief instances, for temporarily buffering,and/or for caching of the information). As used herein, the termnon-transitory computer readable medium is expressly defined to includeany type of computer readable storage device and/or storage disk and toexclude propagating signals and to exclude transmission media.

“Including” and “comprising” (and all forms and tenses thereof) are usedherein to be open ended terms. Thus, whenever a claim employs any formof “include” or “comprise” (e.g., comprises, includes, comprising,including, having, etc.) as a preamble or within a claim recitation ofany kind, it is to be understood that additional elements, terms, etc.may be present without falling outside the scope of the correspondingclaim or recitation. As used herein, when the phrase “at least” is usedas the transition term in, for example, a preamble of a claim, it isopen-ended in the same manner as the term “comprising” and “including”are open ended. The term “and/or” when used, for example, in a form suchas A, B, and/or C refers to any combination or subset of A, B, C such as(1) A alone, (2) B alone, (3) C alone, (4) A with B, (5) A with C, (6) Bwith C, and (7) A with B and with C. As used herein in the context ofdescribing structures, components, items, objects and/or things, thephrase “at least one of A and B” is intended to refer to implementationsincluding any of (1) at least one A, (2) at least one B, and (3) atleast one A and at least one B. Similarly, as used herein in the contextof describing structures, components, items, objects and/or things, thephrase “at least one of A or B” is intended to refer to implementationsincluding any of (1) at least one A, (2) at least one B, and (3) atleast one A and at least one B. As used herein in the context ofdescribing the performance or execution of processes, instructions,actions, activities and/or steps, the phrase “at least one of A and B”is intended to refer to implementations including any of (1) at leastone A, (2) at least one B, and (3) at least one A and at least one B.Similarly, as used herein in the context of describing the performanceor execution of processes, instructions, actions, activities and/orsteps, the phrase “at least one of A or B” is intended to refer toimplementations including any of (1) at least one A, (2) at least one B,and (3) at least one A and at least one B.

The program 300 of FIG. 3 includes block 302, where the example mapbuilder 104 generates a 3D model from acquired images of interest. Theexample map builder 104 retrieves, receives and/or otherwise obtainsinput data associated with the generated 3D model regarding human modelplacements (block 304). As described above, one or more painted regionsof the generated 3D model (e.g., a 3D mesh wireframe) are identified bythe example map builder 104 to determine a particular groupingclassification for placeholder human models. Such groupingclassifications, in some examples, are organized on a zone-by-zonebasis.

The example coordinate engine 108 assigns coordinates to human models(candidate human models that do not yet have one or more aspectcharacteristics associated therewith) (block 306). However, after anumber of human models are added to the 3D model (block 306), theexample model aspect manager 114 assigns characteristics to those humanmodels (block 308). The example photorealism adjustor 126 applies one ormore photorealism adjustments to the human models (block 310), and theexample map builder 104 overlays the originally acquired image(s) on the3D model to create a synthetic map/image of the scene of interest (block312). With human models on the synthetic map in their respectiveassigned coordinates, the example annotation manager 138 annotates thehuman models (block 314).

FIG. 4 illustrates additional detail of the example coordinate engine108 assigning coordinates to the human models (block 306). In theillustrated example of FIG. 4, the example zone selector 110 selects oneof the zones of interest from the 3D map (e.g., the example zone map 200of FIG. 2) (block 402). Based on a type of the selected zone, theexample human model builder 106 generates placeholder human models basedon a target quantity associated with the selected zone (block 404). Asdescribed above, candidate human models are generated by the examplehuman model builder 106 in a manner devoid of one or morecharacteristics so that such characteristics (e.g., attributes) can beselected in a more realistic manner (e.g., application of a statisticalrandomness to improve realistic crowd variety). The example groupingclassifier 112 determines whether the selected zone and correspondingplaceholder human models are associated with a co-dependent placementgrouping classification (block 406). If so, the example coordinateengine 108 assigns the one of the human models to a coordinate locationin the 3D model in a manner corresponding to the co-dependent placementgrouping classification (block 408). The example grouping classifier 112determines whether all placeholder human models for the co-dependentplacement grouping classification have been assigned to a correspondinglocation and, if not (block 410), the human model builder 106 selects anext placeholder human model (block 412). Control then returns to block408.

In the event the example grouping classifier 112 determines that theselected zone is not associated with the co-dependent placement groupingclassification (block 406), then the example coordinate engine 108assigns one of the human models to a corresponding coordinate locationin the 3D model corresponding to an independent placement groupingclassification (block 414). The example grouping classifier 112determines whether all placeholder human models associated with theindependent placement grouping classification have been assigned (block416) and if not, the human model builder 106 selects a next one of theplaceholder human models (block 418). Control returns to block 414 toassign the selected placeholder human model a corresponding coordinatelocation based on the independent placement grouping classification.When all of the candidate placeholder human models have been processed(see blocks 410 and 416), the example zone selector 110 determineswhether all zones of the 3D model have been processed (block 420). Ifnot, control returns to block 402 to select another zone of interest.

FIG. 5 illustrates additional detail associated with efforts by theexample model aspect manager 114 to assign characteristics to humanmodels (block 308). In the illustrated example of FIG. 5, the examplemodel aspect manager 114 selects one of the placed candidate humanmodels from the 3D map (block 502). The example metadata manager 116extracts any available metadata associated with the selected human model(block 504), and the model characteristic modifier 118 selects acandidate model aspect type (block 506). As described above, each aspecttype of interest may have any number of corresponding characteristicsthat define human models. The model characteristic modifier 118 assignsa characteristic of the selected aspect type of interest to the humanmodel (block 508). In some examples, the characteristic assignment isperformed with the aid of a random number generator so that variouscharacteristics of human models exhibit a degree of variation that ismore similar to crowd diversification expectations. The example metadatamanager 116 adds the assigned characteristic to metadata associated withthe selected human model (block 510), and the example modelcharacteristic modifier 118 determines whether there are additionalaspect types of interest to be considered when assigning characteristicsto the selected human model (block 512). If so, then control returns toblock 506, otherwise the example model aspect manager 114 determineswhether there are additional human models to which characteristicsshould be assigned (block 514). If so, then control returns to block502, otherwise the example program 308 of FIG. 5 returns to block 310 ofFIG. 3.

FIG. 6 includes additional detail corresponding to efforts by theexample photorealism adjustor 126 to apply photorealism characteristicsto the human models and/or 3D model (block 310). In the illustratedexample of FIG. 6, the example image data determiner 128 retrieves imagecharacteristics associated with the original image(s) of the scene ofinterest (block 602). The example profile adjustor 130 adjusts a profilesetting(s) of human models based on camera angle information of theretrieved images characteristics (block 604). The example shadowadjustor 132 applies shadows to the human models based on lightingcondition information (block 606), if any. In some examples the lightingcondition information corresponds to a position of the sun when theoriginal image of the scene of interest was taken, thereby allowing aproper shadow creation/adjustment by the example shadow adjustor 132.The example depth of field adjustor 134 adjusts depth-of-fieldcharacteristics of the human models (block 608) based on, for example,camera focal point information, and the example noise adjustor 136applies various noise artifacts to the human models to improve a degreeof photorealism (block 610). Control then returns to block 312 of FIG. 3where the example map builder 104 overlays the originally capturedimage(s) on the 3D model.

FIG. 7 illustrates additional detail corresponding to efforts of theexample annotation manager 138 to annotate the human models (block 314).In the illustrated example of FIG. 7, the example model aspect manager114 selects one of the human models from the 3D model/map (block 702).As described above, all of the inserted human models are now associatedwith coordinate information (e.g., cartesian x, y, z coordinates,rotational positioning coordinates) and characteristic information(e.g., particular characteristics associated with different aspecttypes). The example coordinate engine 108 extracts such 3D coordinateinformation (block 704), and the example transformation engine 140transforms the 3D location coordinates into 2D coordinates (block 706).As described above, the example transformation engine 140 may normalizethe 3D coordinates in a manner consistent with example Equation 1. Anymetadata (e.g., characteristics data) previously associated with thehuman models of the 3D human model are embedded by the example metadatamanager 116 to the 2D location coordinates (block 708), which may beused as label data for machine learning purposes. Additionally, theexample transformation engine 140 maps the human model to the syntheticmap with references to the 2D location coordinates (block 710). Theexample model aspect manager 114 determines whether there are additionalhuman models in the 3D map (block 712) and, if so, control returns toblock 702 to select a next available human model for 2D transformation.

FIG. 8 is a block diagram of an example processor platform 800structured to execute the to implement the digital scene generator 102of FIG. 2. The processor platform 800 can be, for example, a server, apersonal computer, a workstation, a self-learning machine (e.g., aneural network), a mobile device (e.g., a cell phone, a smart phone, atablet such as an iPad), a personal digital assistant (PDA), an Internetappliance, a gaming console, a personal video recorder, a set top box, aheadset or other wearable device, or any other type of computing device.

The processor platform 800 of the illustrated example includes aprocessor 812. The processor 812 of the illustrated example is hardware.For example, the processor 812 can be implemented by one or moreintegrated circuits, logic circuits, microprocessors, GPUs, DSPs, orcontrollers from any desired family or manufacturer. The hardwareprocessor may be a semiconductor based (e.g., silicon based) device. Inthis example, the processor implements the example map builder 104, theexample human model builder 106, the example coordinate engine 108, theexample zone selector 110, the example grouping classifier 112, theexample model aspect manager 114, the example metadata manager 116, theexample model characteristic modifier 118, the example photorealismadjustor 126, the example image data determiner 128, the example profileadjustor 130, the example shadow adjustor 132, the example depth offield adjustor 134, the example noise adjustor 136, the exampleannotation manager 138, the example transformation engine, and theexample digital scene generator 102.

The processor 812 of the illustrated example includes a local memory 813(e.g., a cache). The processor 812 of the illustrated example is incommunication with a main memory including a volatile memory 814 and anon-volatile memory 816 via a bus 818. The volatile memory 814 may beimplemented by Synchronous Dynamic Random Access Memory (SDRAM), DynamicRandom Access Memory (DRAM), RAMBUS® Dynamic Random Access Memory(RDRAM®) and/or any other type of random access memory device. Thenon-volatile memory 816 may be implemented by flash memory and/or anyother desired type of memory device. Access to the main memory 814, 816is controlled by a memory controller.

The processor platform 800 of the illustrated example also includes aninterface circuit 820. The interface circuit 820 may be implemented byany type of interface standard, such as an Ethernet interface, auniversal serial bus (USB), a Bluetooth® interface, a near fieldcommunication (NFC) interface, and/or a PCI express interface.

In the illustrated example, one or more input devices 822 are connectedto the interface circuit 820. The input device(s) 822 permit(s) a userto enter data and/or commands into the processor 812. The inputdevice(s) can be implemented by, for example, an audio sensor, amicrophone, a camera (still or video), a keyboard, a button, a mouse, atouchscreen, a track-pad, a trackball, isopoint and/or a voicerecognition system.

One or more output devices 824 are also connected to the interfacecircuit 820 of the illustrated example. The output devices 824 can beimplemented, for example, by display devices (e.g., a light emittingdiode (LED), an organic light emitting diode (OLED), a liquid crystaldisplay (LCD), a cathode ray tube display (CRT), an in-place switching(IPS) display, a touchscreen, etc.), a tactile output device, a printerand/or speaker. The interface circuit 820 of the illustrated example,thus, typically includes a graphics driver card, a graphics driver chipand/or a graphics driver processor.

The interface circuit 820 of the illustrated example also includes acommunication device such as a transmitter, a receiver, a transceiver, amodem, a residential gateway, a wireless access point, and/or a networkinterface to facilitate exchange of data with external machines (e.g.,computing devices of any kind) via a network 826. The communication canbe via, for example, an Ethernet connection, a digital subscriber line(DSL) connection, a telephone line connection, a coaxial cable system, asatellite system, a line-of-site wireless system, a cellular telephonesystem, etc.

The processor platform 800 of the illustrated example also includes oneor more mass storage devices 828 for storing software and/or data.Examples of such mass storage devices 828 include floppy disk drives,hard drive disks, compact disk drives, Blu-ray disk drives, redundantarray of independent disks (RAID) systems, and digital versatile disk(DVD) drives.

The machine executable instructions 832 of FIGS. 3-7 may be stored inthe mass storage device 828, in the volatile memory 814, in thenon-volatile memory 816, and/or on a removable non-transitory computerreadable storage medium such as a CD or DVD.

From the foregoing, it will be appreciated that example methods,apparatus and articles of manufacture have been disclosed that avoidprivacy issues typically associated with image data acquired from publicsources (e.g., CCTV cameras) when attempting to amass large quantitiesof crowd data for machine learning purposes. Additionally, examplesdisclosed herein prevent errors caused by human discretion whenannotating and/or otherwise labelling items for use by machine learningtraining operations.

Example methods, apparatus, systems, and articles of manufacture togenerate digital scenes are disclosed herein. Further examples andcombinations thereof include the following:

Example 1 includes an apparatus to generate labelled models, theapparatus comprising a map builder to generate a three-dimensional (3D)model of an input image, a grouping classifier to identify a first zoneof the 3D model corresponding to a first type of groupingclassification, a human model builder to generate a quantity ofplaceholder human models corresponding to the first zone, a coordinateengine to assign the quantity of placeholder human models to respectivecoordinate locations of the first zone, the respective coordinatelocations assigned based on the first type of grouping classification, amodel characteristics modifier to assign characteristics associated withan aspect type to respective ones of the quantity of placeholder humanmodels, and an annotation manager to associate the assignedcharacteristics as label data for respective ones of the quantity ofplaceholder human models.

Example 2 includes the apparatus as defined in example 1, wherein themap builder is to generate a geolocated point cloud of coordinate pointsas the 3D model of the input image.

Example 3 includes the apparatus as defined in example 1, wherein thegrouping classifier is to interpret a painted region of the 3D modelbased on a color of the first zone, the color indicative of the firsttype of grouping classification.

Example 4 includes the apparatus as defined in example 1, wherein thefirst type of grouping classification corresponds to a first thresholddistance between respective ones of the quantity of placeholder humanmodels in the first zone.

Example 5 includes the apparatus as defined in example 4, wherein thegrouping classifier is to identify a second zone of the 3D modelcorresponding to a second type of grouping classification, the secondtype of grouping classification corresponding to a second thresholddistance between respective ones of a second quantity of placeholderhuman models in the second zone.

Example 6 includes the apparatus as defined in example 1, wherein thecoordinate engine is to assign the quantity of placeholder human modelsin a manner devoid of the characteristics associated with the aspecttype.

Example 7 includes the apparatus as defined in example 1, wherein thecoordinate engine is to assign a directional orientation to therespective ones of the placeholder human models based on the first typeof grouping classification.

Example 8 includes a non-transitory computer readable medium comprisingcomputer readable instructions that, when executed, cause at least oneprocessor to generate a three-dimensional (3D) model of an input image,identify a first zone of the 3D model corresponding to a first type ofgrouping classification, generate a quantity of placeholder human modelscorresponding to the first zone, assign the quantity of placeholderhuman models to respective coordinate locations of the first zone, therespective coordinate locations assigned based on the first type ofgrouping classification, assign characteristics associated with anaspect type to respective ones of the quantity of placeholder humanmodels, and associate the assigned characteristics as label data forrespective ones of the quantity of placeholder human models.

Example 9 includes the computer readable medium as defined in example 8,wherein the instructions, when executed, cause the at least oneprocessor to generate a geolocated point cloud of coordinate points asthe 3D model of the input image.

Example 10 includes the computer readable medium as defined in example8, wherein the instructions, when executed, cause the at least oneprocessor to interpret a painted region of the 3D model based on a colorof the first zone, the color indicative of the first type of groupingclassification.

Example 11 includes the computer readable medium as defined in example8, wherein the instructions, when executed, cause the at least oneprocessor to assign a first threshold distance between respective onesof the quantity of placeholder human models in the first zone based onthe first type of grouping classification.

Example 12 includes the computer readable medium as defined in example11, wherein the instructions, when executed, cause the at least oneprocessor to identify a second zone of the 3D model corresponding to asecond type of grouping classification, the second type of groupingclassification corresponding to a second threshold distance betweenrespective ones of a second quantity of placeholder human models in thesecond zone.

Example 13 includes the computer readable medium as defined in example8, wherein the instructions, when executed, cause the at least oneprocessor to assign the quantity of placeholder human models in a mannerdevoid of the characteristics associated with the aspect type.

Example 14 includes the computer readable medium as defined in example8, wherein the instructions, when executed, cause the at least oneprocessor to assign a directional orientation to the respective ones ofthe placeholder human models based on the first type of groupingclassification.

Example 15 includes a computer implemented method to generate labelledmodels, the method comprising generating, by executing an instructionwith at least one processor, a three-dimensional (3D) model of an inputimage, identifying, by executing an instruction with the at least oneprocessor, a first zone of the 3D model corresponding to a first type ofgrouping classification, generating, by executing an instruction withthe at least one processor, a quantity of placeholder human modelscorresponding to the first zone, assigning, by executing an instructionwith the at least one processor, the quantity of placeholder humanmodels to respective coordinate locations of the first zone, therespective coordinate locations assigned based on the first type ofgrouping classification, assigning, by executing an instruction with theat least one processor, characteristics associated with an aspect typeto respective ones of the quantity of placeholder human models, andassociating, by executing an instruction with the at least oneprocessor, the assigned characteristics as label data for respectiveones of the quantity of placeholder human models.

Example 16 includes the method as defined in example 15, furtherincluding generating a geolocated point cloud of coordinate points asthe 3D model of the input image.

Example 17 includes the method as defined in example 15, furtherincluding interpreting a painted region of the 3D model based on a colorof the first zone, the color indicative of the first type of groupingclassification.

Example 18 includes the method as defined in example 15, wherein thefirst type of grouping classification corresponds to a first thresholddistance between respective ones of the quantity of placeholder humanmodels in the first zone.

Example 19 includes the method as defined in example 18, furtherincluding identifying a second zone of the 3D model corresponding to asecond type of grouping classification, the second type of groupingclassification corresponding to a second threshold distance betweenrespective ones of a second quantity of placeholder human models in thesecond zone.

Example 20 includes the method as defined in example 15, furtherincluding assigning the quantity of placeholder human models in a mannerdevoid of the characteristics associated with the aspect type.

Example 21 includes the method as defined in example 15, furtherincluding assigning a directional orientation to the respective ones ofthe placeholder human models based on the first type of groupingclassification.

Example 22 includes an apparatus to generate labelled models, theapparatus comprising means for map building to generate athree-dimensional (3D) model of an input image, means for grouping toidentify a first zone of the 3D model corresponding to a first type ofgrouping classification, means for human model building to generate aquantity of placeholder human models corresponding to the first zone,means for coordinate assigning to assign the quantity of placeholderhuman models to respective coordinate locations of the first zone, therespective coordinate locations assigned based on the first type ofgrouping classification, means for model characteristics modification toassign characteristics associated with an aspect type to respective onesof the quantity of placeholder human models, and means for annotation toassociate the assigned characteristics as label data for respective onesof the quantity of placeholder human models.

Example 23 includes the apparatus as defined in example 22, wherein themap building means is to generate a geolocated point cloud of coordinatepoints as the 3D model of the input image.

Example 24 includes the apparatus as defined in example 22, wherein thegrouping means is to interpret a painted region of the 3D model based ona color of the first zone, the color indicative of the first type ofgrouping classification.

Example 25 includes the apparatus as defined in example 22, wherein thefirst type of grouping classification corresponds to a first thresholddistance between respective ones of the quantity of placeholder humanmodels in the first zone.

Example 26 includes the apparatus as defined in example 25, wherein thegrouping means is to identify a second zone of the 3D modelcorresponding to a second type of grouping classification, the secondtype of grouping classification corresponding to a second thresholddistance between respective ones of a second quantity of placeholderhuman models in the second zone.

Example 27 includes the apparatus as defined in example 22, wherein thecoordinate assigning means is to assign the quantity of placeholderhuman models in a manner devoid of the characteristics associated withthe aspect type.

Example 28 includes the apparatus as defined in example 22, wherein thecoordinate assigning means is to assign a directional orientation to therespective ones of the placeholder human models based on the first typeof grouping classification.

Although certain example methods, apparatus and articles of manufacturehave been disclosed herein, the scope of coverage of this patent is notlimited thereto. On the contrary, this patent covers all methods,apparatus and articles of manufacture fairly falling within the scope ofthe claims of this patent.

What is claimed is:
 1. An apparatus to generate labelled models, theapparatus comprising: at least one memory; instructions in theapparatus; and processor circuitry to execute the instructions toinstantiate: a map builder to generate a three-dimensional (3D) model ofan input image; a grouping classifier to identify a first zone of the 3Dmodel corresponding to a first type of grouping classification; a humanmodel builder to generate a quantity of placeholder human modelscorresponding to the first zone; a coordinate engine to: assign thequantity of placeholder human models to respective coordinate locationsof the first zone, the respective coordinate locations assigned based onthe first type of grouping classification; and align a facialorientation when the first type of grouping classification correspondsto a co-dependency between the quantity of placeholder human models, thefacial orientation of respective ones of the placeholder human models tobe aligned based on a variation factor; a model characteristics modifierto assign characteristics associated with an aspect type to respectiveones of the quantity of placeholder human models; and an annotationmanager to associate the assigned characteristics as label data forrespective ones of the quantity of placeholder human models.
 2. Theapparatus as defined in claim 1, wherein the map builder is to generatea geolocated point cloud of coordinate points as the 3D model of theinput image.
 3. The apparatus as defined in claim 1, wherein the firsttype of grouping classification corresponds to a first thresholddistance between respective ones of the quantity of placeholder humanmodels in the first zone.
 4. The apparatus as defined in claim 3,wherein the grouping classifier is to identify a second zone of the 3Dmodel corresponding to a second type of grouping classification, thesecond type of grouping classification corresponding to a secondthreshold distance between respective ones of a second quantity ofplaceholder human models in the second zone.
 5. The apparatus as definedin claim 1, wherein the coordinate engine is to assign the quantity ofplaceholder human models in a manner devoid of the characteristicsassociated with the aspect type.
 6. The apparatus as defined in claim 1,wherein the processing circuitry is to execute the instructions toinstantiate a random number generator to calculate the variation factorbased on a threshold orientation value corresponding to a commonreference focal point.
 7. A non-transitory computer readable mediumcomprising computer readable instructions that, when executed, cause atleast one processor to: generate a three-dimensional (3D) model of aninput image; identify a first zone of the 3D model corresponding to afirst type of grouping classification; generate a quantity ofplaceholder human models corresponding to the first zone; assign thequantity of placeholder human models to respective coordinate locationsof the first zone, the respective coordinate locations assigned based onthe first type of grouping classification; align a facial orientationwhen the first type of grouping classification corresponds to aco-dependency between the quantity of placeholder human models, thefacial orientation of respective ones of the placeholder human models tobe aligned based on a variation factor; assign characteristicsassociated with an aspect type to respective ones of the quantity ofplaceholder human models; and associate the assigned characteristics aslabel data for respective ones of the quantity of placeholder humanmodels.
 8. The computer readable medium as defined in claim 7, whereinthe instructions, when executed, cause the at least one processor togenerate a geolocated point cloud of coordinate points as the 3D modelof the input image.
 9. The computer readable medium as defined in claim7, wherein the instructions, when executed, cause the at least oneprocessor to assign a first threshold distance between respective onesof the quantity of placeholder human models in the first zone based onthe first type of grouping classification.
 10. The computer readablemedium as defined in claim 9, wherein the instructions, when executed,cause the at least one processor to identify a second zone of the 3Dmodel corresponding to a second type of grouping classification, thesecond type of grouping classification corresponding to a secondthreshold distance between respective ones of a second quantity ofplaceholder human models in the second zone.
 11. The computer readablemedium as defined in claim 7, wherein the instructions, when executed,cause the at least one processor to assign the quantity of placeholderhuman models in a manner devoid of the characteristics associated withthe aspect type.
 12. The computer readable medium as defined in claim 7,wherein the instructions, when executed, cause the at least oneprocessor to calculate the variation factor based on a thresholdorientation value corresponding to a common reference focal point.
 13. Acomputer implemented method to generate labelled models, the methodcomprising: generating, by executing an instruction with at least oneprocessor, a three-dimensional (3D) model of an input image;identifying, by executing an instruction with the at least oneprocessor, a first zone of the 3D model corresponding to a first type ofgrouping classification; generating, by executing an instruction withthe at least one processor, a quantity of placeholder human modelscorresponding to the first zone; assigning, by executing an instructionwith the at least one processor, the quantity of placeholder humanmodels to respective coordinate locations of the first zone, therespective coordinate locations assigned based on the first type ofgrouping classification; aligning, by executing an instruction with theat least one processor, a facial orientation when the first type ofgrouping classification corresponds to a co-dependency between thequantity of placeholder human models, the facial orientation ofrespective ones of the placeholder human models to be aligned based on avariation factor; assigning, by executing an instruction with the atleast one processor, characteristics associated with an aspect type torespective ones of the quantity of placeholder human models; andassociating, by executing an instruction with the at least oneprocessor, the assigned characteristics as label data for respectiveones of the quantity of placeholder human models.
 14. The method asdefined in claim 13, further including generating a geolocated pointcloud of coordinate points as the 3D model of the input image.
 15. Themethod as defined in claim 13, wherein the first type of groupingclassification corresponds to a first threshold distance betweenrespective ones of the quantity of placeholder human models in the firstzone.
 16. The method as defined in claim 15, further includingidentifying a second zone of the 3D model corresponding to a second typeof grouping classification, the second type of grouping classificationcorresponding to a second threshold distance between respective ones ofa second quantity of placeholder human models in the second zone. 17.The method as defined in claim 13, further including assigning thequantity of placeholder human models in a manner devoid of thecharacteristics associated with the aspect type.
 18. The method asdefined in claim 13, further including calculating the variation factorbased on a threshold orientation value corresponding to a commonreference focal point.
 19. An apparatus to generate labelled models, theapparatus comprising: means for map building to generate athree-dimensional (3D) model of an input image; means for grouping toidentify a first zone of the 3D model corresponding to a first type ofgrouping classification; means for human model building to generate aquantity of placeholder human models corresponding to the first zone;means for coordinate assigning to: assign the quantity of placeholderhuman models to respective coordinate locations of the first zone, therespective coordinate locations assigned based on the first type ofgrouping classification; and align a facial orientation when the firsttype of grouping classification corresponds to a co-dependency betweenthe quantity of placeholder human models, the facial orientation ofrespective ones of the placeholder human models to be aligned based on avariation factor; means for model characteristics modification to assigncharacteristics associated with an aspect type to respective ones of thequantity of placeholder human models; and means for annotation toassociate the assigned characteristics as label data for respective onesof the quantity of placeholder human models.
 20. The apparatus asdefined in claim 19, wherein the map building means is to generate ageolocated point cloud of coordinate points as the 3D model of the inputimage.
 21. The apparatus as defined in claim 19, wherein the first typeof grouping classification corresponds to a first threshold distancebetween respective ones of the quantity of placeholder human models inthe first zone.
 22. The apparatus as defined in claim 21, wherein thegrouping means is to identify a second zone of the 3D modelcorresponding to a second type of grouping classification, the secondtype of grouping classification corresponding to a second thresholddistance between respective ones of a second quantity of placeholderhuman models in the second zone.
 23. The apparatus as defined in claim19, wherein the coordinate assigning means is to assign the quantity ofplaceholder human models in a manner devoid of the characteristicsassociated with the aspect type.
 24. The apparatus as defined in claim19, wherein the means for coordinate assigning is to calculate thevariation factor based on a threshold orientation value corresponding toa common reference focal point.